Accelerating High-Dimensional Nearest Neighbors for Video Search

نویسندگان

  • Carlo C. del Mundo
  • Alexa Yao
چکیده

The k-nearest neighbor algorithm (kNN) is a critical algorithm used extensively in fields such as Computer Vision, Robotics, and Machine Learning. In this work, we address the performance of FLANN, a popular kNN library, at the node-level by co-designing indexing and search algorithms with software support. We characterize, profile, and optimize FLANN for high-dimensionality (e.g., ≥ 4096) for two systems: an enterprise-grade Intel E5520 CPU and a lowpower ARM Cortex A15 CPU. Our profiling suggests that the priority queue k-means kNN algorithm is the best candidate for software acceleration as 98.8% of indexing and 97.5% of search time is spent on calculating the Euclidean distance between two vectors. We implemented vectorization with Intel SSE and ARM Neon extensions and multithreading with OpenMP. Overall, our software optimizations improve search and indexing by factors of 2.27-fold and 5.16-fold, respectively, for the Intel CPU and 1.45-fold and 4.07-fold, respectively, for the ARM CPU. Finally, we projected the performance in offloading the computations to an on-board GPU in the ARM platform. Our projection suggests that even if GPU computation took zero time, data movement in a unified CPU/GPU SoC platform would still degrade performance by a factor of 9.90-fold compared to the optimized Intel CPU version.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dimensionality Reduction for Image Retrieval

Dimensionality reduction methods are of interest in applications such as content based image and video retrieval. In large multimedia databases, it may not be practical to search through the entire database in order to retrieve the nearest neighbors of a query. Good data structures for similarity search and indexing are needed, and the existing data structures do not scale well for the high dim...

متن کامل

Fast reciprocal nearest neighbors clustering

This paper presents a novel approach for accelerating the popular Reciprocal Nearest Neighbors (RNN) clustering algorithm, i.e. the fast-RNN. We speed up the nearest neighbor chains construction via a novel dynamic slicing strategy for the projection search paradigm. We detail an efficient implementation of the clustering algorithm along with a novel data structure, and present extensive experi...

متن کامل

Towards Meaningful High-Dimensional Nearest Neighbor Search by Human-Computer Interaction

Nearest Neighbor search is an important and widely used problem in a number of important application domains. In many of these domains, the dimensionality of the data representation is often very high. Recent theoretical results have shown that the concept of proximity or nearest neighbors may not be very meaningful for the high dimensional case. Therefore, it is often a complex problem to find...

متن کامل

A Novel Hybrid Approach for Email Spam Detection based on Scatter Search Algorithm and K-Nearest Neighbors

Because cyberspace and Internet predominate in the life of users, in addition to business opportunities and time reductions, threats like information theft, penetration into systems, etc. are included in the field of hardware and software. Security is the top priority to prevent a cyber-attack that users should initially be detecting the type of attacks because virtual environments are not moni...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014